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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Genentech, Inc. 

Hsei, Vanessa 
Koumenis , Iphigenia 
Leong, Steven R. 
Presta, Leonard G. 
Shahrokh, Zahra 
Zapata, Gerardo A. 

(ii) TITLE OF INVENTION: Antibody Fragment -Polymer Conjugates and 

Humanized Anti-IL-8 Monoclonal Antibodies 

(iii) NUMBER OF SEQUENCES: 72 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genentech, Inc. 

(B) STREET: 1 DNA Way 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94080 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 3.5 inch, 1.44 Mb floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: WinPatin (Genentech) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 20-Feb-1998 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Love, Richard B. 

(B) REGISTRATION NUMBER: 34,659 

(C) REFERENCE /DOCKET NUMBER: P1085R3PCT 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 650/225-5530 

(B) TELEFAX : 650/952-9881 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



CAGTCCAACT GTTCAGGACG CC 22 
(2) INFORMATION FOR SEQ ID NO : 2 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



GTGCTGCTCA TGCTGTAGGT GC 22 
(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



GAAGTTGATG TCTTGTGAGT GGC 23 
(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 



GCATCCTAGA GTCACCGAGG AGCC 24 
(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 



CACTGGCTCA GGGAAATAAC CC 22 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 



GGAGAGC TGG GAAGGTGTGC AC 22 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 



ACAAACGCGT ACGCTGACAT CGTCATGACC CAGTC 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 



ACAAACGCGT ACGCTGATAT TGTCATGACT CAGTC 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



ACAAACGCGT ACGCTGACAT CGTCATGACA CAGTC 3 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 



GCTCTTCGAA TGGTGGGAAG ATGGATACAG TTGGTGC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
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t 



CGATGGGCCC GGATAGACCG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 



CGATGGGCCC GGATAGACTG ATGGGGCTGT CGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 



CGATGGGCCC GGATAGACGG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



CGATGGGCCC GGATAGACAG ATGGGGCTGT TGTTTTGGC 3 9 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



CGATGGGCCC GGATAGACTG ATGGGGCTGT TGTTTTGGC 39 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

5 

GACATTGTCA TGACACAGTC TCAAAAATTC ATGTCCACAT CAGTAGGAGA 50 
CAGGGTCAGC GTCACCTGCA AGGCCAGTCA GAATGTGGGT ACTAATGTAG 100 
10 CCTGGTATCA ACAGAAACCA GGGCAATCTC CTAAAGCACT GATTTACTCG 15 0 

TCATCCTACC GGTACAGTGG AGTCCCTGAT CGCTTCACAG GCAGTGGATC 200 
TGGGACAGAT TTCACTCTCA CCATCAGCCA TGTGCAGTCT GAAGACTTGG 250 

15 

CAGACTATTT CTGTCAGCAA TATAACATCT ATCCTCTCAC GTTCGGTCCT 3 00 

GGGACCAAGC TGGAGTTGAA ACGGGCTGAT GCTGCACCAC CAACTGTATC 35 0 

20 CATCTTCCCA CCATTCGAA 369 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 12 3 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

30 

Asp lie Val Met Thr Gin Ser Gin Lys 
1 5 

Gly Asp Arg Val Ser Val Thr Cys Lys 
35 20 

Thr Asn Val Ala Trp Tyr Gin Gin Lys 
35 

40 Ala Leu lie Tyr Ser Ser Ser Tyr Arg 

50 

Arg Phe Thr Gly Ser Gly Ser Gly Thr 
65 

45 

Ser His Val Gin Ser Glu Asp Leu Ala 
80 

Tyr Asn lie Tyr Pro Leu Thr Phe Gly 
50 95 

Leu Lys Arg Ala Asp Ala Ala Pro Pro 
110 

55 Pro Phe Glu 
123 

(2) INFORMATION FOR SEQ ID NO: 18: 

60 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 base pairs 

(B) TYPE: Nucleic Acid 




Phe Met 
10 

Ala Ser 
25 

Pro Gly 
40 

Tyr Ser 
55 

Asp Phe 
70 

Asp Tyr 
85 

Pro Gly 
100 

Thr Val 
115 



Ser Thr 
Gin Asn 
Gin Ser 
Gly Val 
Thr Leu 
Phe Cys 
Thr Lys 
Ser He 



Ser Val 
15 

Val Gly 
30 

Pro Lys 
45 

Pro Asp 
60 

Thr He 
75 

Gin Gin 
90 

Leu Glu 
105 

Phe Pro 
120 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

5 

TTCTATTGCT ACAAACGCGT ACGCTGAGGT GCAGCTGGTG GAGTCTGGGG 5 0 
GAGGCTTAGT GCCGCCTGGA GGGTCCCTGA AACTCTCCTG TGCAGCCTCT 10 0 

10 

GGATTCATAT TCAGTAGTTA TGGCATGTCT TGGGTTCGCC AGACTCCAGG 150 
CAAGAGCCTG GAGTTGGTCG CAACCATTAA TAATAATGGT GATAGCACCT 2 00 
15 ATTATCCAGA CAGTGTGAAG GGCCGATTCA CCATCTCCCG AGACAATGCC 2 50 
AAGAACACCC TGTACCTGCA AATGAGCAGT CTGAAGTCTG AGGACACAGC 3 00 
CATGTTTTAC TGTGCAAGAG CCCTCATTAG TTCGGCTACT TGGTTTGGTT 350 

20 

ACTGGGGCCA AGGGACTCTG GTCACTGTCT CTGCAGCCAA AACAACAGCC 4 00 
CCATCTGTCT ATCCGGG 417 
25 (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: Amino Acid 
30 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Pro Pro Gly 
35 1 5 10 15 

Gly Ser Leu Lys Leu Ser Cys Ala Ala Ser Gly Phe lie Phe Ser 
20 25 30 

40 Ser Tyr Gly Met Ser Trp Val Arg Gin Thr Pro Gly Lys Ser Leu 

35 40 45 

Glu Leu Val Ala Thr lie Asn Asn Asn Gly Asp Ser Thr Tyr Tyr 
50 55 60 

45 

Pro Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ala 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 
50 80 85 90 

Thr Ala Met Phe Tyr Cys Ala Arg Ala Leu lie Ser Ser Ala Thr 
95 100 105 

55 Trp Phe Gly Tyr Trp Gly Gin Gly Thr Leu Val Thr Val Ser Ala 

110 115 120 

Ala Lys Thr Thr Ala Pro Ser Val Tyr Pro 
125 130 

60 

(2) INFORMATION FOR SEQ ID NO: 20: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



ACAAACGCGT ACGCTGATAT CGTCATGACA G 31 
(2) INFORMATION FOR SEQ ID NO: 21: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



GCAGCATCAG CTCTTCGAAG CTCCAGCTTG G 31 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



CCACTAGTAC GCAAGTT C AC G 21 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GATGGGCCCT TGGTGGAGGC TGCAGAGACA GTG 3 3 
(2) INFORMATION FOR SEQ ID NO : 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 
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TGCTACAAAC GCGTACGCTG ATATCGTCAT GACACAGTCT CAAAAATTCA 100 
TGTCCACATC AGTAGGAGAC AGGGTCAGCG TCACCTGCAA GGCCAGTCAG 150 
AATGTGGGTA CTAATGTAGC CTGGTATCAA CAGAAACCAG GGCAATCTCC 200 
TAAAGCACTG ATTTACTCGT CATCCTACCG GTACAGTGGA GTCCCTGATC 250 
GCTTCACAGG CAGTGGATCT GGGACAGATT TCACTCTCAC CATCAGCCAT 300 
GTGCAGTCTG AAGACTTGGC AGACTATTTC TGTCAGCAAT ATAACATCTA 350 
TCCTCTCACG TTCGGTCCTG GGACCAAGCT GGAGCTTCGA AGAGCTGTGG 400 
CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 
GGAACTGCTT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCTACGCCTG 650 
CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 
GGGGAGAGTG TTAA 714 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Val Met Thr Gin Ser 



Gin Lys Phe Met Ser Thr Ser Val Gly Asp Arg Val Ser Val Thr 
35 40 45 

Cys Lys Ala Ser Gin Asn Val Gly Thr Asn Val Ala Trp Tyr Gin 
50 55 60 

Gin Lys Pro Gly Gin Ser Pro Lys Ala Leu lie Tyr Ser Ser Ser 
65 70 75 

Tyr Arg Tyr Ser Gly Val Pro Asp Arg Phe Thr Gly Ser Gly Ser 
80 85 90 

Gly Thr Asp Phe Thr Leu Thr lie Ser His Val Gin Ser Glu Asp 
95 100 105 

Leu Ala Asp Tyr Phe Cys Gin Gin Tyr Asn lie Tyr Pro Leu Thr 



20 



25 



30 



110 



115 



120 
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Phe Gly Pro Gly Thr Lys Leu Glu Leu Arg Arg Ala Val Ala Ala 
125 130 135 

Pro Ser Val Phe lie Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
5 140 145 150 

Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 
155 160 165 

10 Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 

170 175 180 



Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 
200 205 210 



Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
20 215 220 225 

Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

25 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 

(B) TYPE: Nucleic Acid 
30 (C) STRANDEDNESS : Double 

{ D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 

35 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG AGGTGCAGCT GGTGGAGTCT GGGGGAGGCT 10 0 
40 TAGTGCCGCC TGGAGGGTCC CTGAAACTCT CCTGTGCAGC CTCTGGATTC 15 0 
ATATTCAGTA GTTATGG CAT GTCTTGGGTT CGCCAGACTC CAGGCAAGAG 200 
CCTGGAGTTG GTCGCAACCA TTAATAATAA TGGTGATAGC ACCTATTATC 2 50 

45 

CAGACAGTGT GAAGGGCCGA TTCACCATCT CCCGAGACAA TGCCAAGAAC 3 00 
ACCCTGTACC TGCAAATGAG CAGTCTGAAG TCTGAGGACA CAGCCATGTT 3 50 
50 TTACTGTGCA AGAGCCCTCA TTAGTTCGGC TACTTGGTTT GGTTACTGGG 400 
GCCAAGGGAC TCTGGTCACT GTCTCTGCAG CCTCCACCAA GGGCCCATCG 45 0 
GTCTTCCCCC TGGCACCCTC CTCCAAGAGC ACCTCTGGGG GCACAGCGGC 500 

55 

CCTGGGCTGC CTGGTCAAGG ACTACTTCCC CGAAC CGGTG ACGGTGTCGT 55 0 

GGAACTCAGG CGCCCTGACC AGCGGCGTGC ACACCTTCCC GGCTGTCCTA 6 00 

60 CAGTCCTCAG GACTCTACTC CCTCAGCAGC GTGGTGACCG TGCCCTCCAG 650 

CAGCTTGGGC ACCCAGACCT ACATCTGCAA CGTGAATCAC AAGCCCAGCA 700 
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ACACCAAGGT GGACAAGAAA GTTGAGCCCA AATCTTGTGA CAAAACTCAC 750 
ACATGA 756 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 

Gly Gly Gly Leu Val Pro Pro Gly Gly Ser Leu Lys Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Phe lie Phe Ser Ser Tyr Gly Met Ser Trp Val 
50 55 60 

Arg Gin Thr Pro Gly Lys Ser Leu Glu Leu Val Ala Thr lie Asn 
65 70 75 

Asn Asn Gly Asp Ser Thr Tyr Tyr Pro Asp Ser Val Lys Gly Arg 
80 85 90 

Phe Thr lie Ser Arg Asp Asn Ala Lys Asn Thr Leu Tyr Leu Gin 
95 100 105 

Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Met Phe Tyr Cys Ala 
110 115 120 

Arg Ala Leu lie Ser Ser Ala Thr Trp Phe Gly Tyr Trp Gly Gin 
125 130 135 

Gly Thr Leu Val Thr Val Ser Ala Ala Ser Thr Lys Gly Pro Ser 
140 145 150 

Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 

Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr lie 
215 220 225 

Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
230 235 240 
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Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 251 

(2) INFORMATION FOR SEQ ID NO: 28: 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



15 CCAATGCATA CGCTGACATC GTGATGACCC AGACCCC 3 7 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 37 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

CCAATGCATA CGCTGATATT GTGATGACTC AGACTCC 3 7 
30 (2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 7 base pairs 

(B) TYPE: Nucleic Acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

40 

CCAATGCATA CGCTGACATC GTGATGACAC AG AC AC C 3 7 
(2) INFORMATION FOR SEQ ID NO: 31: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

AGATGTCAAT TGCTCACTGG ATGGTGGGAA GATGG 35 
(2) INFORMATION FOR SEQ ID NO: 32: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
60 (B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 



CAAACGCGTA CGCTGAGATC CAGCTGCAGC AG 32 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 



CAAACGCGTA CGCTGAGATT CAGCTCCAGC AG 32 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



GATATCGTGA TGACACAGAC ACCACTCTCC CTGCCTGTCA GTCTTGGAGA 50 
TCAGGCCTCC ATCTCTTGCA GATCTAGTCA GAGCCTTGTA CACGGTATTG 100 
GAAACACCTA TTTACATTGG TAC CTGCAGA AGCCAGGCCA GTCTCCAAAG 15 0 
CTCCTGATCT ACAAAGTTTC CAACCGATTT TCTGGGGTCC CAGACAGGTT 2 00 
CAGTGGCAGT GGATCAGGGA CAGATTTCAC ACTCAGGATC AGCAGAGTGG 250 
AGGCTGAGGA TCTGGGACTT TATTTCTGCT CTCAAAGTAC ACATGTTCCG 3 00 
CTCACGTTCG GTGCTGGGAC CAAGCTGGAG CTGAAACGGG CTGATGCTGC 350 
ACCAACTGTA TCCATCTTCC CACCATCCAG TGAGCAATTG A 391 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 131 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
1 5 io is 

Gly Asp Gin Ala Ser He Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly He Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
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35 40 45 

Gly Gin Ser Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 
50 55 60 

Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
65 70 75 

Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly Leu 
80 85 90 

Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Ala 
95 100 105 

Gly Thr Lys Leu Glu Leu Lys Arg Ala Asp Ala Ala Pro Thr Val 
110 115 120 

Ser lie Phe Pro Pro Ser Ser Glu Gin Leu Lys 
125 130 131 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 



GAGATTCAGC TGCAGCAGTC TGGACCTGAG CTGATGAAGC CTGGGGCTTC 5 0 

AGTGAAGATA TCCTGCAAGG CTTCTGGTTA TTCATTCAGT AGCCACTACA 100 

TGCACTGGGT GAAGCAGAGC CATGGAAAGA GCCTTGAGTG GATTGGCTAC 15 0 

ATTGATCCTT CCAATGGTGA AACTAC TTAC AACCAGAAAT TCAAGGGCAA 200 

GGCCACATTG ACTGTAGACA CATCTTCCAG CACAGCCAAC GTGCATCTCA 2 50 

GCAGCCTGAC ATCTGATGAC TCTGCAGTCT ATTTCTGTGC AAGAGGGGAC 3 00 

TATAGATACA ACGGCGACTG GTTTTTCGAT GTCTGGGGCG CAGGGACCAC 350 

GGTCACCGTC TCCTCCGCCA AAACCGACAG CCCCATCGGT CTATCCGGGC 4 00 
CCATC 405 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 135 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 

-144. 



P1085R3 



20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp lie Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 

Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 
65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 
80 85 90 

Ser Ala Val Tyr Phe Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 
95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr Thr Val Thr Val 
110 115 120 

Ser Ser Ala Lys Thr Asp Ser Pro lie Gly Leu Ser Gly Pro lie 
125 130 135 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 



CTTGGTGGAG GCGGAGGAGA CG 22 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 



GAAACGGGCT GTTGCTGCAC CAACTGTATT CATCTTCC 38 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



GTCACCGTCT CCTCCGCCTC CACCAAGGGC C 31 
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(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 9 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATGAAGAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 
TGCTACAAAT GCATACGCTG ATATCGTGAT GACACAGACA CCACTCTCCC 100 
TGCCTGTCAG TCTTGGAGAT CAGGCCTCCA TCTCTTGCAG ATCTAGTCAG 150 
AGCCTTGTAC ACGGTATTGG AAACACCTAT TTACATTGGT ACCTGCAGAA 2 00 
GCCAGGCCAG TCTCCAAAGC TCCTGATCTA CAAAGTTTCC AACCGATTTT 2 50 
CTGGGGTCCC AGACAGGTTC AGTGGCAGTG GATCAGGGAC AGATTTCACA 300 
CTCAGGATCA GCAGAGTGGA GGCTGAGGAT CTGGGACTTT ATTTCTGCTC 350 
TCAAAGTACA CATGTTCCGC TCACGTTCGG TGCTGGGACC AAGCTGGAGC 4 00 
TGAAACGGGC TGTTGCTGCA CCAACTGTAT TCATCTTCCC ACCATCCAGT 4 50 
GAGCAATTGA AATCTGGAAC TGCCTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 55 0 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAA 72 9 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp He Val Met Thr Gin Thr 
20 25 30 

Pro Leu Ser Leu Pro Val Ser Leu Gly Asp Gin Ala Ser He Ser 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Asn Thr Tyr 
50 55 60 
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Leu His Trp Tyr Leu Gin Lys Pro Gly Gin Ser Pro Lys Leu Leu 
65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Asp Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Arg He Ser Arg 
95 100 105 

Val Glu Ala Glu Asp Leu Gly Leu Tyr Phe Cys Ser Gin Ser Thr 
110 115 120 

His Val Pro Leu Thr Phe Gly Ala Gly Thr Lys Leu Glu Leu Lys 
125 130 135 

Arg Ala Val Ala Ala Pro Thr Val Phe He Phe Pro Pro Ser Ser 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 

Glu Cys 
242 

(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 



ATGAAAAAGA ATATCG C ATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 

TGCTACAAAC GCGTACGCTG AGATTCAGCT GCAGCAGTCT GGACCTGAGC 100 

TGATGAAGCC TGGGGCTTCA GTGAAGATAT CCTGCAAGGC TTCTGGTTAT 150 

TCATTCAGTA GC CACTAC AT GCACTGGGTG AAGCAGAGCC ATGGAAAGAG 200 

CCTTGAGTGG ATTGGCTACA TTGATCCTTC CAATGGTGAA ACTACTTACA 2 50 

ACCAGAAATT CAAGGGCAAG GCCACATTGA CTGTAGACAC ATCTTCCAGC 3 00 
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AC AGC CAACG TGCATCTCAG CAGCCTGACA TCTGATGACT CTGCAGTCTA 350 
TTTCTGTGCA AGAGGGGACT ATAGATACAA CGGCGACTGG TTTTTCGATG 400 
5 TCTGGGGCGC AGGGAC CACG GTCACCGTCT CCTCCGCCTC CACCAAGGGC 45 0 

CCATCGGTCT TCCCCCTGGC ACCCTCCTCC AAGAGCACCT CTGGGGGCAC 500 
AGCGGCCCTG GGCTGCCTGG TCAAGGACTA CTTCCCCGAA CCGGTGACGG 55 0 

10 

TGTCGTGGAA CTCAGGCGCC CTGACCAGCG GCGTGCACAC CTTCCCGGCT 60 0 
GTCCTACAGT CCTCAGGACT CTACTCCCTC AGCAGCGTGG TGAC CGTGCC 65 0 
15 CTCCAGCAGC TTGGGCACCC AGACCTACAT CTGCAACGTG AATCACAAGC 70 0 

CCAGCAACAC CAAGGTGGAC AAGAAAGTTG AGCCCAAATC TTGTGACAAA 750 
ACTCACACAT GA 762 

20 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 
25 (B) TYPE: Amino Acid 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

30 Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu lie Gin Leu Gin Gin Ser 
20 25 30 

35 

Gly Pro Glu Leu Met Lys Pro Gly Ala Ser Val Lys lie Ser Cys 
35 40 45 

Lys Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
40 50 55 60 

Lys Gin Ser His Gly Lys Ser Leu Glu Trp lie Gly Tyr lie Asp 
65 70 75 

45 Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Lys 

80 85 90 

Ala Thr Leu Thr Val Asp Thr Ser Ser Ser Thr Ala Asn Val His 

95 100 105 

50 

Leu Ser Ser Leu Thr Ser Asp Asp Ser Ala Val Tyr Phe Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
55 125 130 135 

Gly Ala Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

60 Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 
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Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 

Asp lie Val Met Thr Gin Thr Pro Leu Ser Leu Pro Val Ser Leu 
15 10 15 

Gly Asp Gin Ala Ser lie Ser Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Leu Gin Lys Pro 
35 40 45 

Gly Gin Ser Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 

Phe Ser Gly Val Pro Asp Arg Phe Ser Asp Ser Gly Ser Gly Thr 
65 70 75 

Asp Phe Thr Leu Arg lie Ser Arg Val Glu Ala Glu Asp Leu Gly 
80 85 90 

Leu Tyr Phe Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly 
95 100 105 

Ala Gly Thr Lys Leu Glu Leu Lys Arg 
110 114 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 
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Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Asn Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 

Gly Lys Ala Pro Lys Leu Leu lie Tyr Tyr Lys Val Ser Asn Arg 
50 55 60 

Phe Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr 
65 70 75 



Asp Phe Thr Leu Thr lie Ser Ser 
80 

Thr Tyr Tyr Cys Ser Gin Ser Thr 
95 

Gin Gly Thr Lys Val Glu lie Lys 
110 



Leu Gin Pro Glu Asp Phe Ala 

85 90 

His Val Pro Leu Thr Phe Gly 

100 105 

Arg 
114 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 109 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 



Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
15 10 15 

Gly Asp Arg Val Thr lie Thr Cys Arg Ala Ser Lys Thr lie Ser 
20 25 30 

Lys Tyr Leu Ala Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys 
35 40 45 

Leu Leu lie Tyr Tyr Ser Gly Ser Thr Leu Glu Ser Gly Val Pro 
50 55 60 

Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr 
65 70 75 

lie Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Gin 
80 85 90 

Gin His Asn Glu Tyr Pro Leu Thr Phe Gly Gin Gly Thr Lys Val 
95 100 105 

Glu lie Lys Arg 
109 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Glu lie Gin Leu Gin Gin Ser Gly Pro Glu Leu Met Lys Pro Gly 
15 10 15 

Ala Ser Val Lys lie Ser Cys Lys Ala Ser Gly Tyr Ser Phe Ser 
20 25 30 

Ser His Tyr Met His Trp Val Lys Gin Ser His Gly Lys Ser Leu 
35 40 45 

Glu Trp lie Gly Tyr He Asp Pro Ser Asn Gly Glu Thr Thr Tyr 
50 55 60 

Asn Gin Lys Phe Lys Gly Lys Ala Thr Leu Thr Val Asp Thr Ser 
65 70 75 

Ser Ser Thr Ala Asn Val His Leu Ser Ser Leu Thr Ser Asp Asp 
80 85 90 

Ser Ala Val Tyr Phe Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 
95 100 105 

Gly Asp Trp Phe Phe Asp Val Trp Gly Ala Gly Thr 
110 115 117 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY : Linear 



SEQ ID NO: 49 : 

Gly Gly Gly Leu Val Gin Pro Gly 
10 15 

Ala Ala Ser Gly Tyr Ser Phe Ser 
25 30 

Arg Gin Ala Pro Gly Lys Gly Leu 
40 45 



(xi) SEQUENCE DESCRIPTION: 

Glu Val Gin Leu Val Glu Ser 
1 5 

Gly Ser Leu Arg Leu Ser Cys 
20 

Ser His Tyr Met His Trp Val 
35 

Glu Trp Val Gly Tyr He Asp 
50 

Asn Gin Lys Phe Lys Gly Arg 
65 



Pro Ser Asn Gly Glu Thr Thr Tyr 
55 60 

Phe Thr He Ser Arg Asp Asn Ser 
70 75 



Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly Asp Tyr Arg Tyr Asn 

95 100 105 

Gly Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr 

110 115 117 



(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 116 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Glu Val Gin Leu Val Glu Ser Gly Gly Gly Leu Val Gin Pro Gly 
15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Ser Phe Thr 
20 25 30 

Gly His Trp Met Asn Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 
35 40 45 

Glu Trp Val Gly Met lie His Pro Ser Asp Ser Glu Thr Arg Tyr 
50 55 60 

Ala Asp Ser Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Ser 
65 70 75 

Lys Asn Thr Leu Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 
80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Ala Arg Gly lie Tyr Phe Tyr Gly 
95 100 105 

Thr Thr Tyr Phe Asp Tyr Trp Gly Gin Gly Thr 
110 lis 116 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Asn Thr Tyr 
50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 
95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 
110 115 120 
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His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 

Glu Cys 
242 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 53 amino acids 
<B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
110 115 120 



Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
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125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 
245 250 253 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 159 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr Glu Lys Met 
1 5 10 15 

Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala Asp Glu Asn 
20 25 30 

Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser Val Ala Thr 
35 40 45 

Asp Tyr Gly Ala Ala He Asp Gly Phe He Gly Asp Val Ser Gly 
50 55 60 

Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala Gly Ser Ser 
65 70 75 

Asn Ser Gin Met Ala Gin Val Gly Asp Gly Asp Asn Ser Pro Leu 
80 85 90 

Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin Ser Val 
95 100 105 

Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr Glu Phe 
110 115 120 

Ser He Asp Cys Asp Lys He Asn Leu Phe Arg Gly Val Phe Ala 
125 130 135 
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Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser Thr Phe 
140 145 150 

Ala Asn lie Leu Arg Asn Lys Glu Ser 
155 159 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGC TACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TAACACGTAT TTACACTGGT ATCAACAGAA 2 00 
AC C AGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 3 00 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 
ACAGAGTACT CATGTCCCGC TCACGTTTGG AC AGGGTAC C AAGGTGGAGA 400 
TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 6 00 
TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 
CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 
CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 
CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
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20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 

35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

Lys Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr 

245 250 253 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 242 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 

15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 

20 25 30 
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Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr lie Thr 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly lie Gly Ala Thr Tyr 
50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 

lie Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr lie Ser Ser 
95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 
110 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu lie Lys 
125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 

Glu Cys 
242 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 amino acids 

(B) TYPE : Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys 
15 10 15 

Gin Leu Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His 
20 25 30 

Leu Glu Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
35 40 45 
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(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 80 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 5 0 

TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 

TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 

AGCTTAGTAC ATGGTATAGG TGCTACGTAT TTACACTGGT ATCAACAGAA 2 00 

ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 2 50 

CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 3 00 

CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 3 50 

ACAGAGTACT CATGTCCCGC TCACGTTTGG AC AGGGTAC C AAGGTGGAGA 4 00 

TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 450 

GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 5 00 

CTATC C CAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 5 50 

CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGC AC C 6 00 

TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 6 50 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 700 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 

CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 92 7 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

( D ) TOPOLOGY : Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA ATATCGCATT 50 

TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 

AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 

CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 2 00 
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GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 2 50 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 3 00 
TTCACTTTAT CTCGCGACAA CTCCAAAAAC ACAGC AT AC C TGCAGATGAA 350 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 400 
ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 450 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 550 
TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 600 
CTGACCAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 
CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 700 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCGTG 800 
CCCAGCACCA GAACTGCTGG GCGGCCGCAT GAAAC AG CT A GAGGACAAGG 850 
TCGAAGAGCT ACTCTCCAAG AACTACCACC TAGAGAATGA AGTGGCAAGA 900 
CTCAAAAAGC TTGT CGGGGA GCGCTAA 927 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 98 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 
50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 
65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 
80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 
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110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 
125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 
140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 
155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 
170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 
185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 
200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 
215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 
230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 
245 250 255 

Pro Cys Pro Ala Pro Glu Leu Leu Gly Gly Arg Met Lys Gin Leu 
260 265 270 

Glu Asp Lys Val Glu Glu Leu Leu Ser Lys Asn Tyr His Leu Glu 
275 280 285 

Asn Glu Val Ala Arg Leu Lys Lys Leu Val Gly Glu Arg 
290 295 298 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6563 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 



GAATTCAACT TCTCCATACT TTGGATAAGG AAATACAGAC ATGAAAAATC 50 

TCATTGCTGA GTTGTTATTT AAGCTTGCCC AAAAAGAAGA AGAGTCGAAT 10 0 

GAAC TGTGTG CGCAGGTAGA AGCTTTGGAG ATTATCGTCA CTGCAATGCT 15 0 

TCGCAATATG GCGCAAAATG ACCAACAGCG GTTGATTGAT CAGGTAGAGG 2 00 

GGGCGCTGTA CGAGGTAAAG CCCGATGCCA GCATTCCTGA CGACGATACG 2 50 

GAGCTGCTGC GCGATTACGT AAAGAAGTTA TTGAAGCATC CTCGTCAGTA 3 00 

AAAAGTTAAT CTTTTCAACA GCTGTCATAA AGTTGTCACG GCCGAGACTT 350 
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ATAGTCGCTT TGTTTTTATT TTTTAATGTA TTTGTAACTA GAATTCGAGC 4 00 
TCGGTACCCG GGGATCCTCT CGAGGTTGAG GTGATTTTAT GAAAAAGAAT 45 0 

5 

ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG CTACAAACGC 500 
ATACGCTGAT ATC CAGATGA CCCAGTCCCC GAGCTCCCTG TCCGCCTCTG 550 
10 TGGGCGATAG GGTCACCATC ACCTGCAGGT CAAGTCAAAG CTTAGTACAT 600 

GGTATAGGTG CTACGTATTT ACACTGGTAT CAACAGAAAC CAGGAAAAGC 65 0 
TCCGAAACTA CTGATTTACA AAGTATC CAA TCGATTCTCT GGAGTCCCTT 700 

15 

CTCGCTTCTC TGGATCCGGT TCTGGGACGG ATTTCACTCT GACCATCAGC 750 
AGTCTGCAGC CAGAAGACTT CGCAACTTAT TACTGTTCAC AGAGTACTCA 800 
20 TGTCCCGCTC ACGTTTGGAC AGGGTACCAA GGTGGAGATC AAACGAACTG 850 

TGGCTGCACC ATCTGTCTTC ATCTTCCCGC CATCTGATGA GCAGTTGAAA 900 
TCTGGAACTG CTTCTGTTGT GTGCCTGCTG AATAACTTCT ATCCCAGAGA 950 

25 

GGCCAAAGTA CAGTGGAAGG TGGATAACGC CCTCCAATCG GGTAACTCCC 1000 
AGGAGAGTGT CACAGAGCAG GACAGCAAGG ACAGCACCTA CAGCCTCAGC 1050 
30 AGCACCCTGA CGCTGAGCAA AGCAGACTAC GAGAAACACA AAGTCTACGC 1100 
CTGCGAAGTC ACCCATCAGG GCCTGAGCTC GCCCGTCACA AAGAGCTTCA 115 0 
ACAGGGGAGA GTGTTAAGCT GATCCTCTAC GCCGGACGCA TCGTGGCCCT 12 00 

35 

AGTACGCAAC TAGTCGTAAA AAGGGTATCT AGAGGTTGAG GTGATTTTAT 12 50 
GAAAAAGAAT ATCGCATTTC TTCTTGCATC TATGTTCGTT TTTTCTATTG 1300 
40 CTACAAACGC GTACGCTGAG GTTCAGCTAG TGCAGTCTGG CGGTGGCCTG 1350 

GTGCAGCCAG GGGGCTCACT CCGTTTGTCC TGTGCAGCTT CTGGCTACTC 14 00 
CTTCTCGAGT CACTATATGC ACTGGGTCCG TCAGGCCCCG GGTAAGGGCC 14 50 

45 

TGGAATGGGT TGGATATATT GATCCTTCCA ATGGTGAAAC TACGTATAAT 15 00 
CAAAAGTTCA AGGGCCGTTT CACTTTATCT CGCGACAACT CCAAAAACAC 1550 
50 AGCATACCTG CAGATGAACA GCCTGCGTGC TGAGGACACT GCCGTCTATT 1600 
ACTGTGCAAG AGGGGATTAT CGCTACAATG GTGACTGGTT CTTCGACGTC 1650 
TGGGGTCAAG GAACCCTGGT CACCGTCTCC TCGGCCTCCA CCAAGGGCCC 1700 

55 

ATCGGTCTTC CCCCTGGCAC CCTCCTCCAA GAGCACCTCT GGGGGCACAG 175 0 

CGGCCCTGGG CTGCCTGGTC AAGGACTACT TCCCCGAACC GGTGACGGTG 1800 

60 TCGTGGAACT CAGGCGCCCT GACCAGCGGC GTGCACACCT TCCCGGCTGT 1850 

CCTACAGTCC TCAGGACTCT ACTCCCTCAG CAGCGTGGTG ACCGTGCCCT 190 0 
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CCAGCAGCTT GGGCACCCAG ACCTACATCT GCAACGTGAA TCACAAGCCC 1950 
AGCAACACCA AGGTCGACAA GAAAGTTGAG CCCAAATCTT GTGACAAAAC 2 000 

5 

TCACACATGC CCGCCGTGCC CAGCACCAGA ACTGCTGGGC GGCCGCATGA 2 050 
AAC AGC TAG A GGACAAGGTC GAAGAGCTAC TCTC CAAGAA CTACCACCTA 2100 
10 GAGAATGAAG TGGCAAGACT CAAAAAGCTT GTCGGGGAGC GCTAAGCATG 2150 

CGACGGCCCT AGAGTCCCTA ACGCTCGGTT GCCGCCGGGC GTTTTTTATT 2200 
GTTAACTCAT GTTTGACAGC TTATCATCGA TAAGCTTTAA TGCGGTAGTT 2250 

15 

TATCACAGTT AAATTGCTAA CGCAGTCAGG CACCGTGTAT GAAATCTAAC 23 00 
AATGCGCTCA TCGTCATCCT CGGCACCGTC ACCCTGGATG CTGTAGGCAT 23 50 
20 AGGCTTGGTT ATGCCGGTAC TGCCGGGCCT CTTGCGGGAT ATCGTCCATT 24 00 
CCGACAGCAT CGCCAGTCAC TATGGCGTGC TGCTAGCGCT ATATGCGTTG 24 50 
ATGCAATTTC TATGCGCACC CGTTCTCGGA GCACTGTCCG ACCGCTTTGG 2 500 

25 

CCGCCGCCCA GTCCTGCTCG CTTCGCTACT TGGAGCCACT ATCGACTACG 2 550 
CGATCATGGC GACCACACCC GTCCTGTGGA TCCTCTACGC CGGACGCATC 26 00 
30 GTGGCCGGCA TCACCGGCGC CACAGGTGCG GTTGCTGGCG CCTATATCGC 2 650 

CGACATCACC GATGGGGAAG ATCGGGCTCG CCACTTCGGG CTCATGAGCG 2 700 
CTTGTTTCGG CGTGGGTATG GTGGCAGGCC CCGTGGCCGG GGGACTGTTG 2 750 

35 

GGCGCCATCT CCTTGCACGC ACCATTCCTT GCGGCGGCGG TGCTCAACGG 2800 
CCTCAACCTA CTACTGGGCT GCTTCCTAAT GCAGGAGTCG CATAAGGGAG 2850 
40 AGCGTCGTCC GATGCCCTTG AGAGCCTTCA ACCCAGTCAG CTCCTTCCGG 2900 
TGGGCGCGGG GCATGACTAT CGTCGCCGCA CTTATGACTG TCTTCTTTAT 2 950 
CATGCAACTC GTAGGACAGG TGCCGGCAGC GCTCTGGGTC ATTTTCGGCG 3000 

45 

AGGACCGCTT TCGCTGGAGC G CGACGATG A TCGGCCTGTC GCTTGCGGTA 3050 
TTCGGAATCT TGCACGCCCT CGCTCAAGCC TTCGTCACTG GTCCCGCCAC 3100 
50 CAAACGTTTC GGCGAGAAGC AGGC CATTAT CGCCGGCATG GCGGCCGACG 3150 
CGCTGGGCTA CGTCTTGCTG GCGTTCGCGA CGCGAGGCTG GATGGCCTTC 32 00 
CCCATTATGA TTCTTCTCGC TTCCGGCGGC ATCGGGATGC CCGCGTTGCA 3250 

55 

GGCCATGCTG TCCAGGCAGG TAGATGACGA CCATCAGGGA CAGCTTCAAG 33 00 

GATCGCTCGC GGCTCTTACC AGCCTAACTT CGATCACTGG ACCGCTGATC 33 50 

60 GTCACGGCGA TTTATGCCGC CTCGGCGAGC ACATGGAACG GGTTGGCATG 34 00 

GATTGTAGGC GCCGCCCTAT ACCTTGTCTG CCTCCCCGCG TTGCGTCGCG 34 50 
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GTGCATGGAG CCGGGCCACC TCGACCTGAA TGGAAGCCGG CGGCACCTCG 3500 
CTAACGGATT CACCACTCCA AGAATTGGAG CCAATCAATT CTTGCGGAGA 3550 

5 

ACTGTGAATG CGCAAACCAA CCCTTGGCAG AACATATCCA TCGCGTCCGC 3600 
CATCTCCAGC AGCCGCACGC GGCGCATCTC GGGCAGCGTT GGGTCCTGGC 3650 
10 CACGGGTGCG CATGATCGTG CTCCTGTCGT TGAGGACCCG GCTAGGCTGG 3 700 

CGGGGTTGCC TTACTGGTTA GCAGAATGAA TCACCGATAC GCGAGCGAAC 3 750 
GTGAAGCGAC TGCTGCTGCA AAACGTCTGC GACCTGAGCA ACAACATGAA 3800 

15 

TGGTCTTCGG TTTCCGTGTT TCGTAAAGTC TGGAAACGCG GAAGTCAGCG 3850 
CCCTGCACCA TTATGTTCCG GATCTGCATC GCAGGATGCT GCTGGCTACC 3 900 
20 CTGTGGAACA CCTACATCTG TATTAACGAA GCGCTGGCAT TGACCCTGAG 3 950 

TGATTTTTCT CTGGTCCCGC CGCATCCATA CCGCCAGTTG TTTACCCTCA 4 000 
C AACGTTC C A GTAACCGGGC ATGTTCATCA TCAGTAACCC GTATCGTGAG 4 050 

25 

CATCCTCTCT CGTTTCATCG GTATCATTAC CCCCATGAAC AGAAATTCCC 4100 
CCTTACACGG AGGCATCAAG TGACCAAACA GGAAAAAACC GCCCTTAACA 4150 
30 TGGCCCGCTT TATCAGAAGC CAGACATTAA CGCTTCTGGA GAAACTCAAC 4200 

GAGCTGGACG CGGATGAACA GGCAGACATC TGTGAATCGC TTCACGACCA 42 50 
CGCTGATGAG CTTTACCGCA GCTGCCTCGC GCGTTTCGGT GATGACGGTG 43 00 

35 

AAAACCTCTG AC AC ATG C AG CTCCCGGAGA CGGTCACAGC TTGTCTGTAA 43 50 
GCGGATGCCG GGAGCAGACA AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 44 00 
40 CGGGTGTCGG GGCGCAGCCA TGACCCAGTC ACGTAGCGAT AGCGGAGTGT 44 50 

ATACTGGCTT AACTATGCGG CATCAGAGCA GATTGTACTG AGAGTGCACC 4500 
ATATGCGGTG TGAAATACCG CACAGATGCG TAAGGAGAAA ATACCGCATC 4550 

45 

AGGCGCTCTT CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG 4600 
GCTGCGGCGA GCGGTATCAG CTCACTCAAA GGCGGTAATA CGGTT AT C C A 4650 
50 CAGAATCAGG GGATAACGCA GGAAAGAACA TGTGAGCAAA AGGCCAGCAA 4 700 
AAGGCCAGGA ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT TCCATAGGCT 4 750 
CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 4800 

55 

GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC 4850 

CTCGTGCGCT CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC 4900 

60 CTTTCTCCCT TCGGGAAGCG TGGCGCTTTC TCATAGCTCA CGCTGTAGGT 4 950 

ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA AGCTGGGCTG TGTGCACGAA 5000 
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CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT ATCGTCTTGA 505 0 

GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 510 0 

ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG 515 0 

TGGTGGCCTA ACTACGGCTA CACTAGAAGG AC AGTATT TG GTATCTGCGC 52 00 

TCTGCTGAAG CCAGTTACCT TCGGAAAAAG AGTTGGTAGC TCTTGATCCG 52 5 0 

GCAAACAAAC CACCGCTGGT AGCGGTGGTT TTTTTGTTTG CAAGCAGCAG 53 00 

ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA TCTTTTCTAC 5350 

GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 54 00 

TGAGATTATC AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA 54 50 

AGTTTTAAAT CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA 5500 

CCAATGCTTA ATCAGTGAGG CACCTATCTC AGCGATCTGT CTATTTCGTT 5550 

CAT C C AT AGT TGCCTGACTC CCCGTCGTGT AGATAACTAC GATACGGGAG 5600 

GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG ACCCACGCTC 5650 

ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC 5700 

GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT 5 750 

TGC CGGGAAG CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT 58 00 

TGTTGCCATT GCTGCAGGCA TCGTGGTGTC ACGCTCGTCG TTTGGTATGG 5850 

CTTCATTCAG CTCCGGTTCC CAACGATCAA GGCGAGTTAC ATGATCCCCC 5900 

ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA TCGTTGTCAG 5950 

AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 6000 

ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG 6 050 

TACTCAACCA AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC 6100 

TTGCCCGGCG TCAACACGGG AT AATAC CGC GCCACATAGC AGAACTTTAA 6150 

AAGTGCTCAT CATTGGAAAA CGTTCTTCGG GGCGAAAACT CTCAAGGATC 62 00 

TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG CACCCAACTG 62 50 

ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG 63 00 

GAAGGCAAAA TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA 63 50 

ATACTCATAC TCTTCCTTTT TCAATATTAT TGAAGCATTT ATCAGGGTTA 64 00 

TTGTCTCATG AGCGGATACA TATTTGAATG TATTTAGAAA AATAAACAAA 64 50 

TAGGGGTTCC GCGCACATTT CCCCGAAAAG TGCCACCTGA CGTCTAAGAA 6500 

ACCATTATTA TCATGACATT AACCTATAAA AATAGGCGTA TCACGAGGCC 6550 
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CTTTCGTCTT CAA 6563 
(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 2 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 io 15 

Ser He Ala Thr Asn Ala Tyr Ala Asp He Gin Met Thr Gin Ser 
20 25 30 

Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 
35 40 45 

Cys Arg Ser Ser Gin Ser Leu Val His Gly He Gly Glu Thr Tyr 
50 55 60 

Leu His Trp Tyr Gin Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu 
65 70 75 

He Tyr Lys Val Ser Asn Arg Phe Ser Gly Val Pro Ser Arg Phe 
80 85 90 

Ser Gly Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He Ser Ser 
95 100 105 

Leu Gin Pro Glu Asp Phe Ala Thr Tyr Tyr Cys Ser Gin Ser Thr 
HO 115 120 

His Val Pro Leu Thr Phe Gly Gin Gly Thr Lys Val Glu He Lys 
125 130 135 

Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
140 145 150 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn 
155 160 165 

Asn Phe Tyr Pro Arg Glu Ala Lys Val Gin Trp Lys Val Asp Asn 
170 175 180 

Ala Leu Gin Ser Gly Asn Ser Gin Glu Ser Val Thr Glu Gin Asp 
185 190 195 

Ser Lys Asp Ser Thr Tyr Ser Leu Ser Ser Thr Leu Thr Leu Ser 
200 205 210 

Lys Ala Asp Tyr Glu Lys His Lys Val Tyr Ala Cys Glu Val Thr 
215 220 225 

His Gin Gly Leu Ser Ser Pro Val Thr Lys Ser Phe Asn Arg Gly 
230 235 240 

Glu Cys 
242 
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(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 



CATGGTATAG GTTAAACTTA TTTACAC 27 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 



CATGGTATAG GTNNSACTTA TTTACAC 2 7 
(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 



ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCATACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCAG GTCAAGTCAA 150 
AGCTTAGTAC ATGGTATAGG TGAGACGTAT TTACACTGGT ATCAACAGAA 200 
ACCAGGAAAA GCTCCGAAAC TACTGATTTA CAAAGTATCC AATCGATTCT 250 
CTGGAGTCCC TTCTCGCTTC TCTGGATCCG GTTCTGGGAC GGATTTCACT 300 
CTGACCATCA GCAGTCTGCA GCCAGAAGAC TTCGCAACTT ATTACTGTTC 350 
ACAGAGTACT CATGTCCCGC TCACGTTTGG AC AGGGTAC C AAGGTGGAGA 4 00 
TCAAACGAAC TGTGGCTGCA CCATCTGTCT TCATCTTCCC GCCATCTGAT 4 50 
GAGCAGTTGA AATCTGGAAC TGCTTCTGTT GTGTGCCTGC TGAATAACTT 500 
CTATCCCAGA GAGGCCAAAG TACAGTGGAA GGTGGATAAC GCCCTCCAAT 550 
CGGGTAACTC CCAGGAGAGT GTCACAGAGC AGGACAGCAA GGACAGCACC 600 
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TACAGCCTCA GCAGCACCCT GACGCTGAGC AAAGCAGACT ACGAGAAACA 650 

CAAAGTCTAC GCCTGCGAAG TCACCCATCA GGGCCTGAGC TCGCCCGTCA 7 00 

CAAAGAGCTT CAACAGGGGA GAGTGTTAAG CTGATCCTCT ACGCCGGACG 750 

CATCGTGGCC CTAGTACGCA ACTAGTCGTA 780 
(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

CTAGTGCAGT CTGGCGGTGG CCTGGTGCAG CCAGGGGGCT CACTCCGTTT 50 
GTC CTGTGCA GCTTCTGGCT ACTCCTTC 78 
(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

TCGAGAAGGA GTAGCCAGAA GCTGCACAGG ACAAACGGAG TGAGCCCCCT 5 0 
GGCTGCACCA GGCCACCGCC AGACTGCACT AG 82 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8120 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 



TTCGAGCTCG 


CCCGACATTG 


ATTATTGACT 


AGAGTCGATC 


GACAGCTGTG 


50 


GAATGTGTGT 


CAGTTAGGGT 


GTGGAAAGTC 


CCCAGGCTCC 


CCAGCAGGCA 


100 


GAAGTATG C A 


AAGCATGCAT 


CTCAATTAGT 


CAGCAACCAG 


GTGTGGAAAG 


150 


TCCCCAGGCT 


CCCCAGCAGG 


CAGAAGTATG 


CAAAGCATGC 


ATCTCAATTA 


200 


GTCAGCAACC 


ATAGTCCCGC 


CCCTAACTCC 


GCCCATCCCG 


CCCCTAACTC 


250 


CGCCCAGTTC 


CGCCCATTCT 


CCGCCCCATG 


GCTGACTAAT 


TTTTTTTATT 


300 


TATGCAGAGG 


CCGAGGCCGC 


CTCGGCCTCT 


GAGCTATTCC 


AGAAGTAGTG 


350 
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AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 4 00 
CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTGACGTAA 450 

5 

GTACCGCCTA TAGAGCGATA AGAGGATTTT ATCCCCGCTG CCATCATGGT 500 
TCGACCATTG AACTGCATCG TCGCCGTGTC CCAAAATATG GGGATTGGCA 550 
10 AGAACGGAGA CCTACCCTGG CCTCCGCTCA GGAACGAGTT CAAGTACTTC 600 

CAAAGAATGA CCACAACCTC TTCAGTGGAA GGTAAACAGA ATCTGGTGAT 650 
TATGGGTAGG AAAACCTGGT TCTCCATTCC TGAGAAGAAT CGACCTTTAA 700 

15 

AGGACAGAAT TAATATAGTT CTCAGTAGAG AACTCAAAGA ACCACCACGA 750 
GGAGCTCATT TTCTTGCCAA AAGTTTGGAT GATGCCTTAA GACTTATTGA 800 
20 ACAACCGGAA TTGGCAAGTA AAGTAGACAT GGTTTGGATA GTCGGAGGCA 850 
GTTCTGTTTA CCAGGAAGCC ATGAATCAAC CAGGCCACCT TAGACTCTTT 900 
GTGACAAGGA TCATGCAGGA ATTTGAAAGT GACACGTTTT TCCCAGAAAT 950 

25 

TGATTTGGGG AAATATAAAC CTCTCCCAGA ATACCCAGGC GTCCTCTCTG 1000 
AGGTCCAGGA GGAAAAAGGC ATCAAGTATA AGTTTGAAGT CTACGAGAAG 1050 
30 AAAGACTAAC AGGAAGATGC TTTCAAGTTC TCTGCTCCCC TCCTAAAGCT 1100 
ATGCATTTTT ATAAG AC CAT GGGACTTTTG CTGGCTTTAG ATCCCCTTGG 115 0 
CTTCGTTAGA ACGCAGCTAC AATTAATACA TAACCTTATG TATCATACAC 12 00 

35 

ATACGATTTA GGTGACACTA TAGATAACAT CCACTTTGCC TTTCTCTCCA 12 5 0 
CAGGTGTCCA CTCCCAGGTC CAACTGCACC TCGGTTCTAT CGATTGAATT 13 0 0 
40 CCACCATGGG ATGGTCATGT ATCATCCTTT TTCTAGTAGC AACTGCAACT 13 5 0 

GGAGTACATT CAGAAGTTCA GCTAGTGCAG TCTGGCGGTG GCCTGGTGCA 14 0 0 
GCCAGGGGGC TCACTCCGTT TGTCCTGTGC AGCTTCTGGC TACTCCTTCT 14 5 0 

45 

CGAGTCACTA TATGCACTGG GTCCGTCAGG CCCCGGGTAA GGGCCTGGAA 15 00 
TGGGTTGGAT ATATTGATCC TTCCAATGGT GAAACTACGT ATAATCAAAA 155 0 
50 GTTCAAGGGC CGTTTCACTT TATCTCGCGA CAACTCCAAA AACACAGCAT 16 0 0 
ACCTGCAGAT GAACAGCCTG CGTGCTGAGG ACACTGCCGT CTATTACTGT 16 50 
GCAAGAGGGG ATTATCGCTA CAATGGTGAC TGGTTCTTCG ACGTCTGGGG 17 0 0 

55 

TCAAGGAACC CTGGTCACCG TCTCCTCGGC CTCCACCAAG GGCCCATCGG 17 50 

TCTTCCCCCT GGCACCCTCC TCCAAGAGCA CCTCTGGGGG CACAGCGGCC 18 0 0 

60 CTGGGCTGCC TGGTCAAGGA CTACTTCCCC GAACCGGTGA CGGTGTCGTG 1850 

GAACTCAGGC GCCCTGACCA GCGGCGTGCA CACCTTCCCG GCTGTCCTAC 1900 
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AGTCCTCAGG ACTCTACTCC CTCAGCAGCG TGGTGACTGT GCCCTCTAGC 195 0 

AGCTTGGGCA CCCAGACCTA CATCTGCAAC GTGAATCACA AGCCCAGCAA 2 000 

5 

C AC C AAGGTG GACAAGAAAG TTGAGCCCAA ATCTTGTGAC AAAACTCACA 2 050 

CATGCCCACC GTGCCCAGCA CCTGAACTCC TGGGGGGACC GTCAGTCTTC 2100 

10 CTCTTCCCCC CAAAACCCAA GGACACCCTC ATGATCTCCC GGACCCCTGA 2150 

GGTCACATGC GTGGTGGTGG ACGTGAGCCA CGAAGACCCT GAGGTCAAGT 2200 

TCAACTGGTA CGTGGACGGC GTGGAGGTGC ATAATGC C AA GACAAAGCCG 22 50 

15 

CGGGAGGAGC AGTACAACAG CACGTACCGT GTGGTCAGCG TCCTCACCGT 2 3 00 

CCTGCACCAG GACTGGCTGA ATGGCAAGGA GTACAAGTGC AAGGTCTCCA 2 3 50 

20 ACAAAGCCCT CCCAGCCCCC ATCGAGAAAA CCATCTCCAA AGCCAAAGGG 24 00 

CAGCCCCGAG AACCACAGGT GTACACCCTG CCCCCATCCC GGGAAGAGAT 2450 

G AC C AAGAAC CAGGTCAGCC TGACCTGCCT GGTCAAAGGC TTCTATCCCA 2 500 

25 

GCGACATCGC CGTGGAGTGG GAG AG C AATG GGCAGCCGGA GAACAACTAC 2 550 

AAGACCACGC CTCCCGTGCT GGACTCCGAC GGCTCCTTCT TCCTCTACAG 2 600 

30 CAAGCTCACC GTGGACAAGA GCAGGTGGCA GCAGGGGAAC GTCTTCTCAT 2 6 50 

GCTCCGTGAT GCATGAGGCT CTGCACAACC ACTACACGCA GAAGAGCCTC 2 700 

TCCCTGTCTC CGGGTAAATG AGTGCGACGG CCCTAGAGTC GACCTGCAGA 2 75 0 

35 

AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT AATGGTTACA 2 800 

AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT TTTTTCACTG 2 850 

40 CATTCTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT ATCATGTCTG 2 900 

GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA AATAACCTCT 2 950 

GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGGAAAGA ACCATCTGTG 3 000 

45 

GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC CCAGCAGGCA 3 050 

GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG GTGTGGAAAG 3100 

50 TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC ATCTCAATTA 3150 

GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG CCCCTAACTC 3200 

CGCCCAGTTC CGCCCATTCT CCGCCCCATG GCTGACTAAT TTTTTTTATT 3250 

55 

TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC AGAAGTAGTG 3300 

AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTA GCTTATCCGG 3350 

60 CCGGGAACGG TGCATTGGAA CGCGGATTCC CCGTGCCAAG AGTCAGGTAA 34 00 

GTACCGCCTA TAGAGTCTAT AGGCCCACCC CCTTGGCTTC GTTAGAACGC 34 50 
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GGCTACAATT AATACATAAC CTTTTGGATC GATCCTACTG ACACTGACAT 35 00 
CCACTTTTTC TTTTTCTCCA CAGGTGTCCA CTCCCAGGTC CAACTGCACC 355 0 

5 

TCGGTTCGCG AAGCTAGCTT GGGCTGCATC GATTGAATTC CACCATGGGA 36 00 
TGGTCATGTA TCATCCTTTT TCTAGTAGCA ACTGCAACTG GAGTACATTC 3650 
10 AGATATCCAG ATGACC CAGT CCCCGAGCTC CCTGTCCGCC TCTGTGGGCG 3 7 00 
ATAGGGTCAC CATCACCTGC AGGTCAAGTC AAAGCTTAGT ACATGGTATA 3750 
GGTGCTACGT ATTTACACTG GTATCAACAG AAACCAGGAA AAGCTCCGAA 38 00 

15 

ACTACTGATT TACAAAGTAT CCAATCGATT CTCTGGAGTC CCTTCTCGCT 38 50 
TCTCTGGATC CGGTTCTGGG ACGGATTTCA CTCTGACCAT CAGCAGTCTG 3 900 
20 CAGCCAGAAG ACTTCGCAAC TTATTACTGT TCACAGAGTA CTCATGTCCC 3 950 
GCTCACGTTT GGACAGGGTA CCAAGGTGGA GATCAAACGA ACTGTGGCTG 4 000 
CACCATCTGT CTTCATCTTC CCGCCATCTG ATGAGCAGTT GAAATCTGGA 4 050 

25 

ACTGC TTCTG TTGTGTGCCT GCTGAATAAC TTCTATCCCA GAGAGGCCAA 4100 
AGTACAGTGG AAGGTGGATA ACGCCCTCCA ATCGGGTAAC TCCCAGGAGA 4150 
30 GTGTCACAGA GCAGGACAGC AAGGACAGCA CCTACAGCCT CAGCAGCACC 42 00 
CTGACGCTGA GCAAAGCAGA C T ACG AG AAA CACAAAGTCT ACGCCTGCGA 42 50 
AGTCACCCAT CAGGGCCTGA GCTCGCCCGT CACAAAGAGC TTCAACAGGG 43 00 

35 

GAGAGTGTTA AGCTTGGCCG CCATGGCCCA ACTTGTTTAT TGCAGCTTAT 43 50 
AATGGTTACA AATAAAGCAA TAGCATCACA AATTTCACAA ATAAAGCATT 44 00 
40 TTTTTCACTG C ATT CTAGTT GTGGTTTGTC CAAACTCATC AATGTATCTT 44 50 
ATCATGTCTG GATCGATCGG GAATTAATTC GGCGCAGCAC CATGGCCTGA 4500 
AATAACCTCT GAAAGAGGAA CTTGGTTAGG TACCTTCTGA GGCGGAAAGA 4550 

45 

ACCAGCTGTG GAATGTGTGT CAGTTAGGGT GTGGAAAGTC CCCAGGCTCC 46 00 
CCAGCAGGCA GAAGTATGCA AAGCATGCAT CTCAATTAGT CAGCAACCAG 4650 
50 GTGTGGAAAG TCCCCAGGCT CCCCAGCAGG CAGAAGTATG CAAAGCATGC 4 7 00 
ATCTCAATTA GTCAGCAACC ATAGTCCCGC CCCTAACTCC GCCCATCCCG 4 7 50 
CCCCTAACTC CGCCCAGTTG CGCCCATTCT CCGCCCCATG GCTGACTAAT 4 8 00 

55 

TTTTTTTATT TATGCAGAGG CCGAGGCCGC CTCGGCCTCT GAGCTATTCC 48 50 

AGAAGTAGTG AGGAGGCTTT TTTGGAGGCC TAGGCTTTTG CAAAAAGCTG 4 900 

60 TTACCTCGAG CGGCCGCTTA ATTAAGGCGC GC C ATTTAAA TCCTGCAGGT 4 950 

AACAGCTTGG CACTGGCCGT CGTTTTACAA CGTCGTGACT GGGAAAACCC 5000 
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TGGCGTTACC CAACTTAATC GCCTTGCAGC ACATCCCCCC TTCGCCAGCT 5050 
GGCGTAATAG CGAAGAGGCC CGCACCGATC GCCCTTCCCA ACAGTTGCGT 5100 

5 

AGCCTGAATG GCGAATGGCG CCTGATGCGG TATTTTCTCC TTACGCATCT 5150 
GTGCGGTATT TCACACCGCA TACGTCAAAG CAACCATAGT ACGCGCCCTG 5200 
10 TAGCGGCGCA TTAAGCGCGG .CGGGTGTGGT GGTTACGCGC AGCGTGACCG 5250 

CTACACTTGC CAGCGCCCTA GCGCCCGCTC CTTTCGCTTT CTTCCCTTCC 5300 
TTTCTCGCCA CGTTCGCCGG CTTTCCCCGT CAAGCTCTAA ATCGGGGGCT 5350 

15 

CCCTTTAGGG TTCCGATTTA GTGCTTTACG GCACCTCGAC CCCAAAAAAC 54 00 
TTGATTTGGG TGATGGTTCA CGTAGTGGGC CATCGCCCTG ATAGACGGTT 5450 
20 TTTCGCCCTT TGACGTTGGA GTCCACGTTC TTTAATAGTG GACTCTTGTT 5500 

CCAAACTGGA ACAACACTCA ACCCTATCTC GGGCTATTCT TTTGATTTAT 5550 
AAGGGATTTT GCCGATTTCG GCCTATTGGT TAAAAAATGA GCTGATTTAA 5600 

25 

CAAAAATTTA ACGCGAATTT TAACAAAATA TTAACGTTTA CAATTTTATG 565 0 
GTGCACTCTC AGTACAATCT GCTCTGATGC CGCATAGTTA AGCCAACTCC 5700 
30 GCTATCGCTA CGTGACTGGG TCATGGCTGC GCCCCGACAC CCGCCAACAC 57 5 0 
CCGCTGACGC GCCCTGACGG GCTTGTCTGC TCCCGGCATC CGCTTACAGA 58 00 
CAAGCTGTGA CCGTCTCCGG GAGCTGCATG TGTCAGAGGT TTTCACCGTC 585 0 

35 

ATCACCGAAA CGCGCGAGGC AGTATTCTTG AAGACGAAAG GGCCTCGTGA 5900 
TACGCCTATT TTTATAGGTT AATGTCATGA TAATAATGGT TTCTTAGACG 595 0 
40 TCAGGTGGCA CTTTTCGGGG AAATGTGCGC GGAACCCCTA TTTGTTTATT 600 0 

TTTCTAAATA CATTCAAATA TGTATCCGCT CATGAGACAA TAACCCTGAT 605 0 
AAATGCTTCA ATAATATTGA AAAAGGAAGA GTATGAGTAT TCAACATTTC 610 0 

45 

CGTGTCGCCC TTATTCCCTT TTTTGCGGCA TTTTGCCTTC CTGTTTTTGC 615 0 
TCACCCAGAA ACGCTGGTGA AAGTAAAAGA TGCTGAAGAT CAGTTGGGTG 62 0 0 
50 CACGAGTGGG TTACATCGAA CTGGATCTCA ACAGCGGTAA GATCCTTGAG 625 0 
AGTTTTCGCC CCGAAGAACG TTTTCCAATG ATGAGCACTT TTAAAGTTCT 63 0 0 
GCTATGTGGC GCGGTATTAT CCCGTGATGA CGCCGGGCAA GAGCAACTCG 635 0 

55 

GTCGCCGCAT ACACTATTCT CAGAATGACT TGGTTGAGTA CTCACCAGTC 64 00 
AC AG AAAAG C ATCTTACGGA TGGCATGACA GTAAGAGAAT TATGCAGTGC 64 5 0 
60 TGCCATAACC ATGAGTGATA ACACTGCGGC CAACTTACTT CTGACAACGA 6500 

TCGGAGGACC GAAGGAGCTA ACCGCTTTTT TGCACAACAT GGGGGATCAT 65 50 
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GTAACTCGCC TTGATCGTTG GGAAC CGGAG CTGAATGAAG CCATACCAAA 6600 
CGACGAGCGT G AC AC C ACG A TGCCAGCAGC AATGGCAACA ACGTTGCGCA 6650 

5 

AACTATTAAC TGGCGAACTA CTTACTCTAG CTTCCCGGCA ACAATTAATA 67 00 
GACTGGATGG AGGCGGATAA AGTTGCAGGA CCACTTCTGC GCTCGGCCCT 6750 
10 TCCGGCTGGC TGGTTTATTG CTGATAAATC TGGAGCCGGT GAGCGTGGGT 6800 
CTCGCGGTAT CATTGCAGCA CTGGGGCCAG ATGGTAAGCC CTCCCGTATC 6850 
GTAGTTATCT ACACGACGGG GAGTCAGGCA ACTATGGATG AACGAAATAG 6900 

15 

ACAGATCGCT GAGATAGGTG CCTCACTGAT TAAGCATTGG TAACTGTCAG 695 0 
ACCAAGTTTA CTCATATATA CTTTAGATTG ATTTAAAACT TCATTTTTAA 7000 
20 TTTAAAAGGA TCTAGGTGAA GATCCTTTTT GATAATCTCA TGACCAAAAT 7 050 

CCCTTAACGT GAGTTTTCGT TCCACTGAGC GTCAGACCCC GTAGAAAAGA 7100 
TCAAAGGATC TTCTTGAGAT CCTTTTTTTC TGCGCGTAAT CTGCTGCTTG 7150 

25 

CAAACAAAAA AACCACCGCT ACCAGCGGTG GTTTGTTTGC CGGATCAAGA 7200 
GCTACCAACT CTTTTTCCGA AGGTAACTGG CTTCAGCAGA GCGCAGATAC 7250 
30 CAAATACTGT CCTTCTAGTG TAGCCGTAGT TAGGCCAC C A CTTCAAGAAC 73 00 

TCTGTAGCAC CGCCTACATA CCTCGCTCTG CTAATCCTGT TACCAGTGGC 73 50 
TGCTGCCAGT GGCGATAAGT CGTGTCTTAC CGGGTTGGAC TCAAGACGAT 74 00 

35 

AGTTACCGGA TAAGGCGCAG CGGTCGGGCT GAACGGGGGG TTCGTGCACA 7450 
CAGCCCAGCT TGGAGCGAAC GACCTACACC GAACTGAGAT ACCTACAGCG 7 500 
40 TGAGCATTGA GAAAGCGCCA CGCTTCCCGA AGGGAGAAAG GCGGACAGGT 755 0 

ATC CGGTAAG CGGCAGGGTC GGAACAGGAG AGCGCACGAG GGAGCTTCCA 7600 
GGGGGAAACG CCTGGTATCT TTATAGTCCT GTCGGGTTTC GCCACCTCTG 765 0 

45 

ACTTGAGCGT CGATTTTTGT GATGCTCGTC AGGGGGGCGG AGCCTATGGA 7 700 
AAAACGC C AG CAACGCGGCC TTTTTACGGT TCCTGGCCTT TTGCTGGCCT 7 75 0 
50 TTTGCTCACA TGTTCTTTCC TGCGTTATCC CCTGATTCTG TGGATAACCG 7800 
TATTACCGCC TTTGAGTGAG CTGATACCGC TCGCCGCAGC CGAACGACCG 7850 
AGCGCAGCGA GTCAGTGAGC GAGGAAGCGG AAGAGCGCCC AATACGCAAA 7 900 

55 

CCGCCTCTCC CCGCGCGTTG GCCGATTCAT TAATCCAGCT GGCACGACAG 7950 

GTTTCCCGAC TGGAAAGCGG GCAGTGAGCG CAACGCAATT AATGTGAGTT 8000 

60 ACCTCACTCA TTAGGC AC C C CAGGCTTTAC ACTTTATGCT TCCGGCTCGT 8050 

ATGTTGTGTG GAATTGTGAG CGGATAACAA TTTCACACAG GAAACAGCTA 8100 
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TGACCATGAT TACGAATTAA 8120 
(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 800 base pairs 

(B) TYPE: Nucleic Acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

AAAAGGGTAT CTAGAGGTTG AGGTGATTTT ATGAAAAAGA AT ATCG C ATT 50 
TCTTCTTGCA TCTATGTTCG TTTTTTCTAT TGCTACAAAC GCGTACGCTG 100 
AGGTTCAGCT AGTGCAGTCT GGCGGTGGCC TGGTGCAGCC AGGGGGCTCA 150 
CTCCGTTTGT CCTGTGCAGC TTCTGGCTAC TCCTTCTCGA GTCACTATAT 2 00 
GCACTGGGTC CGTCAGGCCC CGGGTAAGGG CCTGGAATGG GTTGGATATA 250 
TTGATCCTTC CAATGGTGAA ACTACGTATA ATCAAAAGTT CAAGGGCCGT 3 00 
TTCACTTTAT CTCGCGACAA CTCCAAAAAC AC AGCATAC C TGCAGATGAA 3 50 
CAGCCTGCGT GCTGAGGACA CTGCCGTCTA TTACTGTGCA AGAGGGGATT 4 00 
ATCGCTACAA TGGTGACTGG TTCTTCGACG TCTGGGGTCA AGGAACCCTG 4 50 
GTCACCGTCT CCTCGGCCTC CACCAAGGGC CCATCGGTCT TCCCCCTGGC 500 
ACCCTCCTCC AAGAGCACCT CTGGGGGCAC AGCGGCCCTG GGCTGCCTGG 5 50 
TCAAGGACTA CTTCCCCGAA CCGGTGACGG TGTCGTGGAA CTCAGGCGCC 6 00 
CTGAC CAGCG GCGTGCACAC CTTCCCGGCT GTCCTACAGT CCTCAGGACT 650 
CTACTCCCTC AGCAGCGTGG TGACCGTGCC CTCCAGCAGC TTGGGCACCC 7 00 
AGACCTACAT CTGCAACGTG AATCACAAGC CCAGCAACAC CAAGGTCGAC 750 
AAGAAAGTTG AGCCCAAATC TTGTGACAAA ACTCACACAT GCCCGCCTGA 800 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
15 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Gin Ser 
20 25 30 

Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 

-173- 



P1085R3 



35 40 45 

Ala Ala Ser Gly Tyr Ser Phe Ser Ser His Tyr Met His Trp Val 

50 55 60 

Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Gly Tyr lie Asp 

65 70 75 

Pro Ser Asn Gly Glu Thr Thr Tyr Asn Gin Lys Phe Lys Gly Arg 

80 85 90 

Phe Thr Leu Ser Arg Asp Asn Ser Lys Asn Thr Ala Tyr Leu Gin 

95 100 105 

Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ala 

110 115 120 

Arg Gly Asp Tyr Arg Tyr Asn Gly Asp Trp Phe Phe Asp Val Trp 

125 130 135 

Gly Gin Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly 

140 145 150 

Pro Ser Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly 

155 160 165 

Gly Thr Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu 

170 175 180 

Pro Val Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val 

185 190 195 

His Thr Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu 

200 205 210 

Ser Ser Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr 

215 220 225 

Tyr lie Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp 

230 235 240 

Lys Lys Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Cys Pro 

245 250 255 

Pro 
256 

(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Glu Val Gin Leu Val Gin Ser Gly Gly Gly Leu Val Gin Pro Gly 

15 10 15 

Gly Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Tyr Ser Phe Ser 

20 25 30 
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Ser His Tyr Met His Trp Val Arg Gin Ala Pro Gly Lys Gly Leu 

35 40 45 

Glu Trp Val Gly Tyr lie Asp Pro Ser Asn Gly Glu Thr Thr Tyr 

50 55 60 

Asn Gin Lys Phe Lys Gly Arg Phe Thr Leu Ser Arg Asp Asn Ser 

65 70 75 

Lys Asn Thr Ala Tyr Leu Gin Met Asn Ser Leu Arg Ala Glu Asp 

80 85 90 

Thr Ala Val Tyr Tyr Cys Ala Arg Gly Asp Tyr Arg Tyr Asn Gly 

95 100 105 

Asp Trp Phe Phe Asp Val Trp Gly Gin Gly Thr Leu Val Thr Val 

110 lis 120 



Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro 

20 125 130 135 

Ser Ser Lys Ser Thr Ser Gly Gly Thr Ala Ala Leu Gly Cys Leu 

140 145 150 

25 Val Lys Asp Tyr Phe Pro Glu Pro Val Thr Val Ser Trp Asn Ser 

155 160 165 



Gly Ala Leu Thr Ser Gly Val His Thr Phe Pro Ala Val Leu Gin 

170 175 180 

Ser Ser Gly Leu Tyr Ser Leu Ser Ser Val Val Thr Val Pro Ser 

185 190 195 



Ser Ser Leu Gly Thr Gin Thr Tyr lie Cys Asn Val Asn His Lys 
35 200 205 210 

Pro Ser Asn Thr Lys Val Asp Lys Lys Val Glu Pro Lys Ser Cys 
215 220 225 

40 Asp Lys Thr His Thr Cys Pro Pro Cys Pro Ala Pro Glu Leu Leu 

230 235 240 



Gly Gly Pro Ser Val Phe Leu Phe Pro Pro Lys Pro Lys Asp Thr 
245 250 255 

Leu Met lie Ser Arg Thr Pro Glu Val Thr Cys Val Val Val Asp 
260 265 270 



Val Ser His Glu Asp Pro Glu Val Lys Phe Asn Trp Tyr Val Asp 

50 275 280 285 

Gly Val Glu Val His Asn Ala Lys Thr Lys Pro Arg Glu Glu Gin 

290 295 300 

55 Tyr Asn Ser Thr Tyr Arg Val Val Ser Val Leu Thr Val Leu His 

305 310 315 



Gin Asp Trp Leu Asn Gly Lys Glu Tyr Lys Cys Lys Val Ser Asn 
320 325 330 

Lys Ala Leu Pro Ala Pro lie Glu Lys Thr lie Ser Lys Ala Lys 
335 340 345 
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Gly Gin Pro Arg Glu Pro Gin Val Tyr Thr Leu Pro Pro Ser Arg 

350 355 360 

Glu Glu Met Thr Lys Asn Gin Val Ser Leu Thr Cys Leu Val Lys 

365 370 375 

Gly Phe Tyr Pro Ser Asp lie Ala Val Glu Trp Glu Ser Asn Gly 

380 385 390 

Gin Pro Glu Asn Asn Tyr Lys Thr Thr Pro Pro Val Leu Asp Ser 

395 400 405 

Asp Gly Ser Phe Phe Leu Tyr Ser Lys Leu Thr Val Asp Lys Ser 

410 415 420 

Arg Trp Gin Gin Gly Asn Val Phe Ser Cys Ser Val Met His Glu 

425 430 435 

Ala Leu His Asn His Tyr Thr Gin Lys Ser Leu Ser Leu Ser Pro 

440 445 450 

Gly Lys 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: Amino Acid 
(D) TOPOLOGY: Linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Asp lie Gin Met Thr Gin Ser Pro Ser Ser Leu Ser Ala Ser Val 
1 5 io is 

Gly Asp Arg Val Thr lie Thr Cys Arg Ser Ser Gin Ser Leu Val 
20 25 30 

His Gly lie Gly Ala Thr Tyr Leu His Trp Tyr Gin Gin Lys Pro 
35 40 45 

Gly Lys Ala Pro Lys Leu Leu lie Tyr Lys Val Ser Asn Arg Phe 
50 55 60 

Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Thr Asp 
65 70 75 

Phe Thr Leu Thr lie Ser Ser Leu Gin Pro Glu Asp Phe Ala Thr 
80 85 90 

Tyr Tyr Cys Ser Gin Ser Thr His Val Pro Leu Thr Phe Gly Gin 
95 100 105 

Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala Pro Ser Val 
HO H5 120 

Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser Gly Thr Ala 
125 130 135 

Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg Glu Ala Lys 



452 
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140 145 150 

Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly Asn Ser Gin 

155 160 165 

5 

Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr Tyr Ser Leu 

170 175 180 

Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu Lys His Lys 
10 185 190 195 

Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser Ser Pro Val 
200 205 210 

15 Thr Lys Ser Phe Asn Arg Gly Glu Cys 

215 219 



-177- 



